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Abstract 

This paper concerns the self-assembly of scaled-up versions of arbitrary finite shapes. We work in the 
multiple temperature model that was introduced by Aggarwal, Cheng, Goldwasser, Kao, and Schweller 
(Complexities for Generalized Models of Self-Assembly, SODA 2004). The multiple temperature model is 
a natural generalization of Winfree's abstract tile assembly model, where the temperature of a tile system 
is allowed to be shifted up and down as self-assembly proceeds. We first exhibit two constant-size tile 
sets in which scaled-up versions of arbitrary shapes self-assemble. Our first tile set has the property that 
each scaled shape self-assembles via an asymptotically "Kolmogorov-optimum" temperature sequence 
but the scaling factor grows with the size of the shape being assembled. In contrast, our second tile set 
assembles each scaled shape via a temperature sequence whose length is proportional to the number of 
points in the shape but the scaling factor is a constant independent of the shape being assembled. We 
then show that there is no constant-size tile set that can uniquely assemble an arbitrary (non-scaled, 
connected) shape in the multiple temperature model, i.e., the scaling is necessary for self-assembly. This 
answers an open question of Kao and Schweller (Reducing Tile Complexity for Self-Assembly Through 
Temperature Programming, SODA 2006), who asked whether such a tile set existed. 



*This research was supported in part by National Science Foundation Grants 0652569 and 0728806, and by NSF-IGERT 
Training Project in Computational Molecular Biology Grant number DGE-0504304 
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1 Introduction 



Self-assembly is a process by which a small number of fundamental components automatically coalesce to 
form a target structure. In 1998, Winfree [21] introduced the abstract Tile Assembly Model (aTAM) as 
an over-simplified discrete mathematical model of the DNA tile self-assembly pioneered by Seeman [17]. 
The aTAM is an "cffcctivization" of classical Wang tiling [19,20] in which the fundamental components are 
un-rotatable, but translatable square "tile types" whose sides are labeled with glue "colors" and "strengths." 
Two tiles that are placed next to each other interact if the glue colors on their abutting sides match, and 
they bind if the strength on their abutting sides matches with total strength at least a certain ambient 
"temperature." Extensive refinements of the aTAM were given by Rothcmund and Winfree in [14, 15]. 

Despite its deliberate over-simplification, the aTAM is a computationally expressive model in the sense 
that Winfree [21] proved it is Turing-universal in two (or more) spatial dimensions. This suggests that it is 
possible, in principle, to algorithmically direct the process of self-assembly. The aTAM has also been studied 
from the perspective of computational complexity theory. A problem that has received substantial attention 
is that of finding "small" tile sets that assemble N x N squares in the aTAM. For instance, Adleman, 

Cheng, Goel, and Huang [1] proved that N x N squares self-assemble with O ( i^g f gN ) distinct tile types, 
matching the Kolmogorov-dictated lower bound that was established in [15]. The more general problem 
of the self-assembly of arbitrary shapes in the aTAM has also been considered. Most notably, Soloveichik 
and Winfree [18] discovered a beautiful connection between the the Kolmogorov complexity of an arbitrary 
scaled shape and the minimum number of tile types required to assemble it. 

In addition to being an elegant and powerful theoretical tool, there is also experimental justification for 
the aTAM. For example, using DNA double-crossover molecules to construct tiles only a few nanometers long, 
Rothcmund, Papadakis and Winfree [16] implemented the molecular self-assembly of the well-known fractal 
structure called the discrete Sierpinski triangle with low enough error rates to achieve correct placement of 
100 to 200 tiles. Moreover, Barish, Schulman, Rothcmund and Winfree [3] have recently used Rothcmund's 
DNA origami [13] as a seed structure to which subsequent "computation" DNA tiles can attach and assemble 
computationally interesting patterns with error rates less than .2%! Note that this technique, although 
robust, is not general-purpose in the sense that all of the information about the to-be- assembled shape (or 
pattern) is encoded into the DNA tiles and origami seed. 

In fact, a central problem in algorithmic self-assembly is that of providing input to a tile assembly system 
(e.g., the size of a square, the description of a shape, etc.). In real-world laboratory implementations, as well 
as theoretical constructions, input to a tile system in the aTAM is provided via a (possibly large) collection 
of "hard-coded" seed tile types [1,3,15,18]. Unfortunately in practice, it is more expensive to manufacture 
many different types of tiles, as opposed to creating several copies of each tile type. This suggests that it 
might be advantageous to be able to provide input to a tile system without having to resort to hard-coding 
the input into a large number its own tiles. As a result, several natural generalizations of the aTAM have 
been developed in an attempt to model various types of alternative input delivery mechanisms. 

One such model is the staged self-assembly model [6,12], in which several intermediate structures are 
allowed to assemble in different test tubes before they are all mixed together to obtain the target structure. 
Demaine, Demaine, Fekete, Ishaque, Rafalin, Schweller, and Souvaine [6] proved that arbitrary shapes self- 
assemble with O(l) tile types but with a corresponding increase in the number of stages and even (in some 
cases) an increase in the scale of the target shape. Note that, in the staged self-assembly model, the input 
to a tile system is implicitly encoded in the actions of the laboratory scientist and not in the tile types 
themselves. 

Another means of providing input to a tile system is through the programming of the relative concen- 
trations of its tile types. Becker, Rapaport, and Rcmila [4] proved that by appropriately setting the relative 
concentrations of tiles, squares, rectangles and diamonds can self-assemble in an expected sense with O(l) 
tile types, but with a large (and undesirable) variance. Kao and Schweller [9] improved the aforementioned 
result by showing that it is possible to program the relative concentrations of O(l) tile types such that they 
will assemble into arbitrarily close approximations of N x N squares with high probability. Furthermore, 
Doty [7] recently showed that N x N squares self-assemble exactly with high probability using O(l) tile 
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types. 

The multiple temperature model [2, 8] is a natural generalization of the aTAM, where the temperature 
of a tile system is dynamically adjusted by the experimenter as self-assembly proceeds. Aggarwal, Cheng, 
Goldwasser, Kao, and Schweller [2] proved that the number of tile types required to assemble "thin" k x N 

rectangles can be reduced from fl ^ ^ (in the aTAM) to £1 ^ ^ f gN ) ^ ^ nc temperature is allowed to 
change but once. Subsequently, Kao and Schweller [8] discovered a clever "bit-flipping" scheme capable 
of assembling any N x N square using O(l) tile types and 0(logAT) temperature changes. Note that the 
multiple temperature model has a similar flavor to that of the staged self-assembly model in the sense that 
the input to a tile system in both models can be encoded into a sequence of laboratory operations. 

In all of the results mentioned in the previous three paragraphs, with the notable exception of [6], 
attention was focused on the problem of reducing the number of distinct tile types needed for the assembly 
of certain restricted classes of shapes such as diamonds, thin rectangles or squares. In this paper, we study 
the broader problem of reducing the number of tiles needed to assemble arbitrary finite shapes in the multiple 
temperature model. 

In particular, we exhibit two constant-size tile sets in which scaled-up versions of arbitrary shapes self- 
assemble. Our first tile set has the property that each scaled shape self-assembles via a temperature sequence 
whose length is proportional to the Kolmogorov complexity of the shape, but the scaling factor grows with 
the size of the shape being assembled. In contrast, our second tile set assembles each scaled shape via a 
temperature sequence whose length is proportional to the number of points in the shape but the scaling 
factor is a constant independent of the shape being assembled. Finally, we show that the scale factor in both 
of our constructions is necessary, i.e., that there is no constant-size tile set that can uniquely assemble an 
arbitrary shape in the multiple temperature model. This answers an open question of Kao and Schweller [8], 
who asked whether such a tile system existed. 

The remainder of this paper is organized as follows. In Section 2, we review basic definitions and 
notation for both the abstract tile assembly model and the multiple temperature model. In Section 3, 
we exhibit two constant-size tile sets in which scaled-up versions of arbitrary shapes self-assemble in the 
multiple temperature model. In Section 4, we prove that there is no general-purpose tile set capable of the 
self-assembly of arbitrary shapes in the multiple temperature model. Section 5 contains concluding remarks 
and states an open question. 

2 Preliminaries 

We work in the 2-dimensional discrete space Z 2 . Define the set U2 = {(0, 1), (1, 0), (0, —1), (—1, 0)} to be the 
set of all unit vectors, i.e., vectors of length I 'ml?. We write [X] 2 for the set of all 2-element subsets of a set 
X. All graphs here are undirected graphs, i.e., ordered pairs G = (V, E), where V is the set of vertices and 
EC [V] 2 is the set of edges. A grid graph is a graph G — (V, E) in which V CI? and every edge {a, b} G E 
has the property that a — b G 1/2- The full grid graph on a set V C 1? is the graph Gy — (V, E) in which E 
contains every {a, b} G [V] 2 such that a — b G Ui. 

A shape is a set X C 1? such that G% is connected. In this paper, we consider scaled-up versions 
of finite shapes. Formally, if X is a shape and c G N, then a c-scaling of X is defined as the set X c = 
{(x,y) G 1? I (|_f J > LcJ) e }■ Intuitively, X c is the shape obtained by replacing each point in X with a 
c x c block of points. We refer to the natural number c as the scaling factor or resolution loss. Note that 
scaled shapes have been studied extensively in the context of a variety of self-assembly systems [5,6,18,22]. 

2.1 The Abstract Tile Assembly Model 

We now give a brief and intuitive sketch of the aTAM that is adequate for reading this paper. More formal 
details and discussion may be found in [10, 14, 15, 21]. 

Intuitively, a tile type t is a unit square that can be translated, but not rotated, having a well-defined 
"side w" for each «6l/ 2 . Each side u of t has a "glue" of "color" col t (u) - a string over some fixed alphabet 
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S - and "strength" str t (u) - a nonnegative integer - specified by its type t. Two tiles t and t' that are 
placed at the points a and a + u respectively, bind with strength str t (u) if and only if (colt (u) ,str t (u)) = 
(coh/ (-u) ,str t / (-tt)). 

Given a set T of tile types, an assembly is a partial function a : Z 2 — » T, with points x <E Z 2 at 
which a(af) is undefined interpreted to be empty space, so that dom a is the set of points with tiles. An 
assembly is r- stable, where r G N, if it cannot be broken up into smaller assemblies without breaking 
bonds of total strength at least r. For an assembly a, each m £ Z 2 , and each u S J7 2 , str Q (m, u) = 
str Q („i)(-S) • [col a(7a) (5) = col a(fll+ 3)(-u) and str Q(ra) (u) = str a(A+s) (~M)] where [</>] is the Boolean value of 
the statement (the Boolean value on the right is if {m, m + u} dom a). The r- frontier of an assembly 
a, written as <9 T a is the set of all points to which a tile can be r-stably added to a. 

Self-assembly begins with a seed assembly a (typically assumed to be finite and r-stable) and proceeds 
asynchronously and nondetcrministically, with tiles absorbing one at a time to the existing assembly in any 
manner that preserves stability at all times. All of the tile assembly systems in this paper are assumed to 
have a single seed tile placed at the origin. 

A tile assembly system (TAS) is an ordered triple T = (T, a, r), where T is a finite set of tile types, a is 
a seed assembly with finite domain, and r is the temperature. An assembly sequence in a TAS T = (T, a, r) 
is a (possibly infinite) sequence a = (a, | < i < I) of assemblies in which ao = a and each Oj+i is obtained 
from oti by the "r-stable" addition of a single tile. Let a = (ao, a±, . . .) and a' = (a' , a't, . . .) be assembly 
sequences. We say that a is a prefix of a', written as a C a', if there exists j such that for every < i < j, 
cti — a^. An assembly a is terminal, and we write a 6 «4q [T], if no tile can be stably added to it. We write 
A\j [T] for the set of all terminal assemblies of T. A TAS T uniquely produces an assembly, if it has exactly 
one terminal assembly i.e., \Au [T] | = 1. 

2.2 The Multiple Temperature Model 

In the multiple temperature model, a tile assembly system is defined as an ordered triple T = (t, a, (t^^Lq 1 ^ , 

where the third component (t^^q 1 is a sequence of non-negative integer temperatures. The number k is the 
temperature complexity of T. The temperature range of a tile assembly system is the largest temperature in 
its temperature sequence. 

Throughout the remainder of this section, let T = (t, a, (t^^Lq 1 ^ be a multiple temperature tile assembly 

system. Intuitively, self-assembly in T is carried out in k phases. In the first temperature phase, tiles are 
added to the existing assembly as they normally would be in the abstract model until a To-stable terminal 
assembly is reached. In phase two, tiles can accrete to the existing assembly if they can do so with at least 
strength t\. Also, and at any time during the second temperature phase, if there is ever a cut of the assembly 
having a strength less than t\ , then all of the tiles on the side of the cut not containing the seed can be 
removed from the assembly. When a ri-stable terminal assembly is reached in phase two, phase three begins 
and proceeds in a similar fashion. This process continues through the final temperature phase in which tiles 
are added or removed with respect to the temperature Tfe_i until reaching a r^-i-stable terminal assembly. 
See Figure [T] for a specific example of the self-assembly of a bit-flip gadget [8] in the multiple temperature 
model. 

We define an assembly sequence for the i th temperature phase of T as follows. If i = 0, then an assembly 
sequence for temperature phase of T is an assembly sequence a of the tile assembly system 7o = (T, a, To). 
For i > 0, an assembly sequence for temperature phase i of T is a (possibly infinite) sequence of assemblies 
a = (o>q, cti, . . .) satisfying the following conditions. 

1. There exists I such that ai is Ti_i-stable, d Ti ~ 1 ai = and (ao, at,..., ai) is an assembly sequence for 
temperature phase i — 1 of T; and 

2. for all j > I, Oij+i is obtained from ctj by the r^-stable addition of a single tile or the deletion of a cut 
of aj - to which the seed tile does not belong - having strength less than t.; . 
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(a) Tile types (b) t = 2 (c) n = 5 (d) (e) 

Figure 1: In this example, thick notches are strength 5, and thin notches are strength 1. 



An assembly sequence in T is an assembly sequence for the i th temperature phase of T for some i 6 N. 
We say that an assembly sequence a in T finishes every temperature phase of T if a is a finite assembly 
sequence for temperature stage k — 1 of T and its final assembly, denoted a, is Tfc-i-stable and d Tk - 1 a = 0. 
In this case, we call a a terminal assembly and write a £ Arj [T]. If, for every assembly sequence a in T, 
there exists an assembly sequence a! in T such that a' finishes every temperature phase of T, a Q a' , and 
Ao [T] = {<^}i then T uniquely produces the assembly a. For a given shape X, we say that T uniquely 
produces the shape X if Au = {a} and dom a = X. 

3 Self- Assembly of Arbitrary Scaled Shapes with 0(1) Tile Types 

In this section, we exhibit two constructions that are capable of building scaled-up versions of arbitrary shapes 
in the multiple temperature model. Both constructions reduce the tile complexity for the self-assembly of 
arbitrary scaled shapes from ( j^rj^y) J [18] to 0(1), but with a corresponding increase in temperature 
complexity. 

3.1 Optimum Temperature Sequences but Unbounded Scaling Factors 

In our first construction, we simply combine a portion of the main construction of [18] with the bit-flip 
gadget of [8]. Fix some universal Turing machine U. The Kolmogorov complexity of a shape X, denoted as 
K(X), is the size of the smallest program tt that outputs an encoding of a list of all the points in X. In 
other words K(X) = min{|7r| | U(n) — (S)}. The reader is encouraged to consult [11] for a more detailed 
discussion of Kolmogorov complexity. 

Theorem 3.1. There exists a tile set T with \T\ — 0(1) such that, for every finite shape X, there exists 
c G N and a temperature sequence (n)^~Q with k = 0{K(X)) such that T x " = (t, a, (t,)^Tq J uniquely 
produces X c . 

Proof. The basic idea is to combine the tile set (of Theorem 3.1) of [8] that assembles an 11 x 2m rectangle 
(whose top row encodes the bits of an arbitrary binary string of length m) with the portion of the tile set 
from [18] that does not contain any "seed-frame" or "un-packing" tile types. Thus, given any program 7r 
that, when run by U , outputs X as a list of points, we can use bit-flip gadgets to encode a description of 
tt. Then the main construction from [18] can proceed normally at temperature t^-i = 2. The only (minor) 
technicality is that we must add additional tile types to ensure that the seed block is properly assembled, 
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(a) Partial seed block: west growing simulation of U on n (b) A completed seed block 



Figure 2: In our first construction, the seed square assembles first. Then four identical rectangles simultaneously 
assemble off each side of the seed square via the corresponding temperature sequence defined in the proof of Theorem 
3.1 in [8]. After the self-assembly of the four rectangles, the rotated 'L' structure initiates the assembly of a square 
in which the bits of 7r are rotated up and to the left into the computation space. Finally, U is simulated on 7r in the 
computation space. After the simulation is done, the shape X is encoded along the border of the seed block. The 
filler tiles are used to ensure that the seed block is a square. 

which we illustrate in Figured Note that the size of this tile set is O(l), i.e., it is independent of the shape 
being assembled, and the temperature complexity is O ( 1 7r | ) . □ 

Remark 3.2. Theorem 13. II is tight. For instance, for an algorithmically random string w = wqWi, ■ ■ ■ w n — i, 
the shape X(w) = {(x , Vo) , (xi, yi) , . . . (x n -i, Vn-i)} defined by (x ,yo) = (0,0), and (x i+ i,y i+ i) = 
(xi+i, yi + 1) if Wi = 0, and (xi + 1, yi+x) if to = 1 (i.e., a path that goes right if Wi = and up if 
wi = 1), has K{X{w)) w K(w) > n. 

Note that in the construction for Theorem 13. 1( the scaling factor can be quite large. Specifically, the 
scaling factor c depends on the running time of 7r, whence c = poly(time(n)) [18]. Also, the scaling factor 
in the above construction is further inflated (albeit by a constant factor) by the filler tiles in the assembly 
of the seed block. In a truly nano-scale setting, it is necessary to have a construction in which the scaling 
factor is always small, or better yet, bounded by a constant independent of the shape being assembled. We 
now show how to achieve this. 



G 



3.2 Constant Scaling Factor but Long Temperature Sequences 



Recall that for any scaled finite shape X c , each point in X gets mapped to a c x c block of points in X c . 
In our second construction, we will build a simple square gadget that will be responsible for the assembly of 
each c x c block in X c . As a result, the scaling factor c in our second construction will depend entirely on 
the size of the square gadgets. 
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Figure 3: Overview of square gadgets: each square gadget consists of two bit-flip gadgets. The second (a.k.a. upper) 
bit-flip gadget remembers the value of the first gadget and thus can initiate the correct change in direction. The little 
black notches on the borders of the first three square gadgets initiate the growth of another (appropriately-rotated) 
square gadget. 

Intuitively, each square gadget consists of two logical components: a lower and an upper half. The lower 
half of a square gadget is the concatenation of two bit-flip gadgets such that the second bit-flip gadget 
"remembers" the value of the first. The upper half of each square gadget then places a special output tile 
along the left, top or right side of the square depending on the values of the bit-flip gadgets in the lower half. 
Finally, the special output tile initiates the growth of another (appropriately-rotated) square gadget and the 
process is repeated for every point in the shape. Figure [3] gives an intuitive overview of the four canonical 
square gadgets. A more detailed example of the self-assembly of a square gadget is shown in Figures [B] and [7] 
in the technical appendix. 

Theorem 3.3. There exists a tile set T with |T| = O(l), such that, for every finite shape X, if there is a 
Hamiltonian path C in G%, then there exists a temperature sequence (t^JLq 1 with k — 0{\X\), such that 
T xl1 — (t,<j, {Ti)^ = Q^j uniquely produces X 11 . 

Proof. Let R = ( !? J be the standard 90° counterclockwise rotation matrix. If t is a tile type, then 



v 1 

we define R(t) — t' , such that, for all u € U2, str^/ (u) — str t (i? • u) and coif (u) = colt (R ■ u). Notice that 
t' is simply the clockwise rotation of t. Let t SC0( i be the single seed tile type defined in Figured] 
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Figure 4: The unique seed tile type i S eed- 

Let T be the set of tile types satisfying (1) t see( j £ T and (2), for every tile type t that is defined in Figure[5] 
in the technical appendix, t, R(t), R 2 (t) and R 3 (t) are all elements of T. This means that T contains four 
logical copies of the square gadget each having a different "type" of direction. As a final technical matter, 
we adjust all of the glue colors (excluding the 'A' glue color) on all of the tiles in T such that two tiles can 
bind if and only if the respective square gadgets to which they belong have the same type of direction. 

Let X be an arbitrary finite shape such that there exists a Hamiltonian path C — (vq, . . . , i^jxl— 1) 
of G* (here, C is a sequence of vertices). Let Uq — (0,1). For all 1 < i < \X\, define the unit vector 
Ui — Vi — Wj-i. Define the temperature sequence (tq, n, . . . , Tg.ijf i-i), where for each < i < \X\ — 1, 
T8i = 4, 

_ J 4 if Hi — R ■ Ui-\ or Ui = Uj_i 

T Sl +l ~ I g otherw j sej 
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T&+2 — 3, T 8 j + 3 — 8, Tgi+4 — 4, 



_ f 4 if ttj = (-ii) • or Uj = Uj_! 
Tsi+5 ~ \ 9 otherwise, 

T8i+6 = 3, and r 8i+7 = 8. Finally, let t%\ X \-& = 4, r 8 | X |_ 7 = 9, t s \ X \-g = 3, T 8 pc|-5 = 8, r 8 | X |_ 4 = 4, 
r 8|x|-3 = 9 ; r 8|x|-2 = 3, and t s \ x \-i = 8. Let f xl1 = (r,a, (Ti)^.^' -1 ^. We will now describe how the 

scaled shape X 11 self-assembles in T x ". 

Assume without loss of generality that if i — 0, then T2 = t@ = 4 (i.e., v% — vq = (0, 1)). The initial 
temperature subsequence (r , Ti, . . . r 7 ) assembles (1) an initial 11 x 11 "seed" square gadget, denoted Sq, 
from the single seed tile, and (2) the first 1 x 11 row (column) of an appropriately-rotated square gadget 
attached to the top side of the seed gadget. 

In general, for each < i < \X\ — 1, the temperature subsequence (r 8i , . . . r 8i+ 7) assembles (1) an 
appropriately-rotated 11 x 11 square gadget, denoted Si, attached to the square gadget assembled in the 
previous temperature subsequence, and (2) the first lxll row (column) of an appropriately- rotated square 
gadget attached to: the top side of Si if r 8 i+i = r 8 ^+5 = 4; the left side if r 8 i+i = 4 and t 8 ^+5 = 9; or 
the right side if r 8 ^+i = 9 and = 4. Intuitively, by our choices of r 8 i+i and T$i+5, we can force the 

"direction" of self-assembly to follow the Hamiltonian path C . Namely, we set: r 8 i + i = r 8 j + 5 = 4 in order 
to continue straight (relative to the current direction of self-assembly); r 8 i + i = 4 and r 8 i+5 = 9 to initiate a 
relative left turn; t 8 ^ + i = 9 and r 8 j + 5 = 4 to initiate a relative right turn; and rgi+i = 9 and r 8 i + 5 = 9 to 
halt self-assembly. 

Finally, we have Tgixl-7 = t"8|x|-3 = 9, and the last temperature subsequence assembles the final 11 x 11 
(halting) square gadget attached to the square gadget assembled in the previous temperature subsequence. 

Since each square gadget is an 11 x 11 square, and C is a Hamiltonian path of G x , we have that X 11 
self-assembles in T xl1 . An intuitive illustration of this process is depicted in Figures [ ^b)| and [ ^c)| Also, 
Figures [5] and [7] in the technical appendix give a detailed example of how to program a square gadget to 
turn left. A formal proof of the fact that T x uniquely produces X 11 is, although tedious, straightforward, 
and therefore omitted. □ 
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(a) Example 
shape X 



(b) Hamiltonian path 

of G " 




(c) Self-assembly of X 22 . The 'S' tile is the single seed tile. 
Notice that the first three square gadgets have the same 
type of direction, while the fourth square gadget is rotated 
90° clockwise. 



Figure 5: Overview of second construction for Theorem 13.51 

In our second construction, we will encode a Hamiltonian path of a particular finite shape X into a 
temperature sequence in order to assemble the scaled-up version of X. Unfortunately, not all shapes have 
Hamiltonian paths, which might suggest that this approach is doomed to fail. Lucky for us, however, we 
have the following technical lemma. 
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Lemma 3.4. If X is a finite shape, then there exists a Hamiltonian cycle G in G* 



x 2 



Proof. Note that, in this proof, we will think of Hamiltonian cycles as sequences of edges. For every finite 
shape X, define the set B(X) — {(x, y) G X \ (3u G U2) satisfying (x, y) + u £ X}. One can think of the set 
B(X) as the set of all points from which it is possible to "get away from" the shape X in one step. In what 
follows, we will prove that there exists a Hamiltonian cycle G in G^ 2 with the following property P: 

For every (w, x), (y, z) G B (X 2 ) with (y, z) - (w, x) G J7 2 , such that (|_|J , [§ J) = 
( [f J, L§J), the edge ((w, £),(</,*)) G G. 

Our proof is by induction on \X\. For the base case, we have \X\ = 1, and it is routine to verify that G^ 2 
has a Hamiltonian cycle G satisfying property P. 

For the inductive case, let X be a shape with \X\ = k + 1. We will show that G^ 2 has a Hamiltonian 
cycle G satisfying property P. Let x G X be an arbitrary point such that X — {x} is a shape. Then 
define the shape Y = X — {x}. Since Y is a shape with F| = fc, the induction hypothesis tells us that 
Gy 2 has a Hamiltonian cycle -D = (eo, ei, . . . e&_i) satisfying property P. We will use D to construct a 
Hamiltonian cycle G in G^ 2 having property P as follows. Let a,b,c,d G X 2 be such that |a, 6, c, = 
{(x,y) J ([|J , [|J) = x}. Since D satisfies property P, there exist points p,q£Y 2 such that, there exists 
u E U2 satisfying a + it = p, d + u = q, and (p, q) <E D. Suppose that ej = (p, q) for some < j < k. 
Finally, let G = (eo, ex, ... , ej—i, (p, a) , (a, bj , ^6, cj , (c, ctj , (d, qj , e J+ i, . . . , ek-i^- It is clear that G is a 
Hamiltonian cycle in Gf 2 having property P. □ 



Lemma 13.41 says that, if we are comfortable with doubling the scaling factor, we can always use a 
Hamiltonian cycle to direct the self-assembly of an arbitrary scaled shape. 

Theorem 3.5. There exists a tile set T with \T\ = 0(1) such that, for every finite shape X, there exists a 
temperature sequence (Tj)Jlg with k = 0(\X\), such that T x22 — (r,a, (Ti)i=o) un iq uer y produces X 22 . 



Proof. The theorem follows by Theorem 13.31 Lemma 13.41 and the simple observation that, for any finite 
shape X, (X 2 ) 11 =X 2 ' n =X 22 . □ 



4 Impossibility of Self- Assembly of Arbitrary Shapes with O(l) 
Tile Types 

At this point, a natural question might be the following: Is the scaling factor in both of our constructions 
absolutely necessary? This question can be stated formally as follows. 

Question 4.1 (Kao and Schweller [8]). Does there exist a tile set T, with \T\ = O(l), such that for every 
finite shape X, there exists a temperature sequence (t^JLq 1 such that T = ^T, cr, (t^^Lq 1 ^ uniquely produces 
XI 

In the remainder of this section, we prove that the answer to Question 14. II is "no." In other words, we 
prove that the scaling factor in each of our constructions is necessary. 

We must first, however, define some notation that we will use in our proof. If a = {a^,a\, . . . 
is an assembly sequence in T and fa G Z 2 , then the a-index of fa is is(fa) = min{i G N | fa, G dom aj}. 
That is, the a-index of fa is the time at which any tile is first placed at location fa by a. For each location 

fa G lj 0<i<; dom «i, define IN"(m) = |mG U2 is(fn + u) < i&(fa) and str a . {A) (fa,u) > j. Intuitively, 

the set IN Q (m) is the set of sides on which the first tile that of places at location fa initially binds. We now 
have the machinery to prove the following result. 
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Theorem 4.2. For every tile set T, there exists a finite shape X C Z such that for each temperature 
sequence (Ti)*_g, T = ^T, <r, (ri)*_Q^ does not uniquely produce X. 

Proof. Fix a set of tile types T, and let X = {0, ... , |T|} x {0}. We will show that, given any temperature 

sequence (t»)*_q , the tile system T = (T, a, (t,-)*_q ^ does not uniquely produce X. To get a contradiction, 

assume that T uniquely produces X and let a be the unique assembly satisfying a G A\j \T\. Since \X\ > \T\, 
there must exist x p ,x q € X such that a(x p ) = a(x q ). Since T uniquely produces X, we know that, by 
definition, every assembly sequence in T is finite. Therefore, it suffices to exhibit an infinite assembly 
sequence in T. The next fact is an easy consequence of the pigeonhole principle. 

Fact 1. If a = (ao, a\, . . . , a m _i) is an assembly sequence in T such that, for all < i < m, dom ati C X 
and dom a m -\ — X, then there exists an infinite assembly sequence a! in T. 

For each assembly sequence a = (ao, «i, ■ ■ ■ , a m -i) in T such that for some < i < m, dom a, — X =/= 0, 
let ys be the unique point ys^X such that, for all z X U {?/<;}, ?<; (ys) < is (•?)■ Intuitively, the point 
is the location of the first tile that a places at any point not in X. The next fact gives sufficient conditions 
for the existence of an infinite assembly sequence in T . 

Fact 2. Let a = (ao, ct±, . . . , a m -\) be an assembly sequence in T such that for some < i < m, dom a, — 
1^0, and u be the unique vector satisfying u € IN a (y$). If str Q . ^ a ) (ys,u) < Tk~i and, for every x E X, 



is (x) < i s (ys) a ts( i) (x) 



_ J a (x — (afg — x p )) if x — x q el 
a (x) otherwise, 



then there exists an infinite assembly sequence a' in T. 

Proof, (of FactHj) Let (xo,0) = x £ X be the location of the rightmost tile in the assembly a is ^ s y We 
can define an infinite assembly sequence a 1 = (a' , a'^, . . .) as follows. For < i < is(ys), let a[ — 
Qj. Notice that at this point, the temperature of the current temperature phase must be less than Tk-\ 
since str a4- ^ a ^ (ys,u) < Tk—i. This means that we can keep placing tiles sequentially at points in X 
(because a is r^_ 1 -stable, whence every bond between adjacent tiles in a must be at least this strong). For 
is (ys) < i < — 1) + ig (ys) — xq, let a' i+1 be the assembly obtained from a\ by placing a tile of type 
a (x + (i — is (ys) + 1,0) — (x q — x p )) at the point x+(i — is (ys) + L 0) if x+ (i — is (ys) + L 0) — x q 6 X, 
and a (x + (i — is (ys) + L 0)) otherwise. At this point, a tile has been placed at every point in X. But 
why stop there? For i > (\X\ — 1) + is (ys) — Xo — 1, let a' i+1 be obtained from a- by placing a tile of 
type a'i ((i + 1 — (is (ys) — xo — 1) , 0) — (x q — x p )) at the point (i + 1 — (is (ys) — xq — 1) , 0). A routine 
induction argument can be used to show that a' is an infinite (eventually) periodic assembly sequence in 
T. □ 

Fact 3. There exists an assembly sequence a* — (a^, a*, . . . , a^j) in T such that for some < i < m, 
dom a* - I / 8 and, for every id. 

/ — * \ /^\ f ol (x — (x„ — x v )) if x — x„ £ X 

is* (x) < is* (Vs*) =>■ Cii~,(x)( x ) = > )->\ i.i • 

w va ; m 1 ; \ a(x) otherwise. 

Fact [3] follows by the simple observations that every assembly sequence in T is finite, and until some tile 
is placed at some point outside of X, every tile must bind via a single input side. 

Let a* = (ato, a*, . . . , a* n _ 1 ) be the assembly sequence in T given in Fact [3] Suppose that the first tile 
that a* places at ys* & X binds to the single seed tile. In this case, the tile that is placed at ys* must bind 
via a single bond having strength less than Tk-i, because otherwise (if it were to bind with strength at least 
Tfc_i), it would be possible to place a tile at ys* before finishing the final temperature phase (contradicting 
the the fact that T uniquely produces X). If the first tile that a* places at ys* binds to the single seed tile 
with strength less than tk-i, then Fact Ogives us an infinite assembly sequence a' in T - a contradiction. 
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Therefore, the first tile that a* places at the point ys* cannot bind to the seed tile. Let u be the unique 
vector satisfying u 6 IN Q (ys*)- Suppose that str Qj r (g gt ) (ys*,u) > t^-i- If (ys* + u) — x q £ X, then 
it would be possible to place some tile at the point ys* + u during the final temperature phase because 
a ia*(vs*+s) (Va* + u) — a(ys* +u) (a contradiction). On the other hand, if (ys* + u) — x q 6 X, then it 
would be possible to place some tile at the point (ys* + u) — (x q — x p ) during the final temperature phase 
because a is , uj a , (ys* +u) = ot((ys* + u) — (x q — x p )) (a contradiction). Therefore, it must be that 
str a .^ (j7 a ») (y<3* ' ^ r fe-ij an( i Fact[2]gives us an infinite assembly sequence a' in T - a contradiction. □ 

5 Conclusion 

In this paper, we showed how to reduce the tile complexity for the self-assembly of arbitrary scaled shapes in 
the multiple temperature model. We developed two general purpose tile sets capable of assembling scaled-up 
versions of arbitrary finite shapes through appropriately chosen sequences of non-negative integer temper- 
atures. While our first construction assembled shapes via short (asymptotically Kolmogorov-optimum) 
temperature sequences, the scaling factor grows (unboundedly) with the size of the shape being assembled. 
In contrast, our second construction assembled shapes via long temperature sequences but with a constant 
"universal" scaling factor in the sense that it is the same for every shape. We then proved that, for every 
constant-size tile set T, there is some finite shape that T cannot uniquely produce via any temperature 
sequence, which implies the necessity of the scaling factor in both of our constructions. A natural direction 
for future theoretical research in the multiple temperature is the next question, which asks whether we can 
simultaneously optimize the two criteria of a short temperature sequence and a constant scaling factor, which 
were optimized separately in the constructions of this paper. 

Question 5.1. Does there exist a tile set T with |T| = O(l) and c £ N, such that, for every shape X, 
there exists a temperature sequence (t^JlTq 1 with k = 0(K(X)), such that T xc = (t,ct, (t^JIqJ uniquely 
produces X c l 

Acknowledgment 

This paper is a direct result of stimulating conversations with David Doty and Matthew Patitz. 



11 



References 



[1] Leonard Adleman, Qi Cheng, Ashish Goel, and Ming-Deh Huang, Running time and program size for 
self-assembled squares, STOC '01: Proceedings of the thirty-third annual ACM Symposium on Theory 
of Computing (New York, NY, USA), ACM, 2001, pp. 740-748. 

[2] Gagan Aggarwal, Qi Cheng, Michael H. Goldwasser, Ming- Yang Kao, Robert T. Schweller, and 
Pablo Moisset de Espanes, Complexities for generalized models of self-assembly, SIAM Journal on Com- 
puting 34 (2005), 1493-1515. 

[3] Robert D. Barish, Rebecca Schulman, Paul W. Rothemund, and Erik Winfree, An information-bearing 
seed for nucleating algorithmic self-assembly, Proceedings of the National Academy of Sciences (2009). 

[4] Florent Becker, Ivan Rapaport, and Eric Rcmila, Self-assembling classes of shapes with a minimum 
number of tiles, and in optimal time, Foundations of Software Technology and Theoretical Computer 
Science (FSTTCS), 2006, pp. 45-56. 

[5] Ho-Lin Chen and Ashish Goel, Error free self-assembly with error prone tiles, Proceedings of the 10th 
International Meeting on DNA Based Computers, 2004. 

[6] Erik D. Demaine, Martin L. Demaine, Sandor P. Fekctc, Mashhood Ishaque, Eynat Rafalin, Robert T. 
Schweller, and Diane L. Souvainc, Staged self-assembly: nanomanufacture of arbitrary shapes with 0(1) 
glues, Natural Computing 7 (2008), no. 3, 347-370. 

[7] David Doty, Randomized self-assembly for exact shapes, Proceedings of the 50th Annual IEEE Sympo- 
sium on Foundations of Computer Science, IEEE, 2009, to appear. 

[8] Ming- Yang Kao and Robert T. Schweller, Reducing tile complexity for self-assembly through temperature 
programming, Proceedings of the 17th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 
2006), Miami, Florida, Jan. 2006, pp. 571-580, 2007. 

[9] , Randomized self-assembly for approximate shapes, International Colloqium on Automata, Lan- 
guages, and Programming (ICALP) (Luca Aceto, Ivan Damgrd, Leslie Ann Goldberg, Magns M. Hall- 
drsson, Anna Inglfsdttir, and Igor Walukicwicz, eds.), Lecture Notes in Computer Science, vol. 5125, 
Springer, 2008, pp. 370-384. 

[10] James I. Lathrop, Jack H. Lutz, and Scott M. Summers, Strict self-assembly of discrete Sierpinski 
triangles, Theoretical Computer Science 410 (2009), 384-405. 

[11] Ming Li and Paul Vitanyi, An introduction to kolmogorov complexity and its applications (texts in 
computer science), Springer, February 1997. 

[12] John H. Rcif, Local parallel biomolecular computation, University of Pennsylvania, American Mathe- 
matical Society, 1999, pp. 217-254. 

[13] Paul W. K. Rothemund, Folding DNA to create nanoscale shapes and patterns, Nature 440, no. 7082, 
297-302. 

[14] Paul W. K. Rothemund, Theory and experiments in algorithmic self-assembly, Ph.D. thesis, University 
of Southern California, December 2001. 

[15] Paul W. K. Rothemund and Erik Winfree, The program-size complexity of self-assembled squares (ex- 
tended abstract), STOC '00: Proceedings of the thirty-second annual ACM Symposium on Theory of 
Computing (New York, NY, USA), ACM, 2000, pp. 459-468. 

[16] Paul W.K. Rothemund, Nick Papadakis, and Erik Winfree, Algorithmic self-assembly of dna sierpinski 
triangles, PLoS Biology 2 (2004), no. 12. 



12 



[17] Nadrian C. Seeman, Nucleic- acid junctions and lattices, Journal of Theoretical Biology 99 (1982), 237- 
247. 

[18] David Soloveichik and Erik Winfree, Complexity of self-assembled shapes, SIAM Journal on Computing 
36 (2007), no. 6, 1544-1569. 

[19] Hao Wang, Proving theorems by pattern recognition - II, The Bell System Technical Journal XL (1961), 
no. 1, 1-41. 

[20] , Dominoes and the AEA case of the decision problem, Proceedings of the Symposium on Math- 
ematical Theory of Automata (New York, 1962), Polytechnic Press of Polytechnic Inst, of Brooklyn, 
Brooklyn, N.Y., 1963, pp. 23-55. 

[21] Erik Winfree, Algorithmic self-assembly of DNA, Ph.D. thesis, California Institute of Technology, June 
1998. 

[22] Erik Winfree and Renat Bckbolatov, Proofreading tile sets: Error correction for algorithmic self- 
assembly., DNA (Junghuei Chen and John H. Rcif, cds.), Lecture Notes in Computer Science, vol. 
2943, Springer, 2003, pp. 126-144. 



13 



6 Technical Appendix 
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(e) (4,3,8,4) 
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(d) (4,3,8); Note that B4 binds cooperatively and thus 
"remembers" the bit of the first bit-flip gadget. 
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(f) (4,3,8,4,9); the tile in the upper-left corner has been 
flipped from to 1 while still remembering the value of the 
previous bit-flip gadget. 
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(g) (4,3,8,4,9,3) 
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(h) (4,3,8,4,9,3,8) 



Figure 6: An example of programming a square gadget to "turn left." We do this by setting the first bit-flip gadget 
to 0. Then we set the second bit-flip gadget to 1. Note that the latter bit-flip gadget remembers the value of the 
former bit-flip gadget. 
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Figure 7: (4, 3, 8, 4, 9, 3, 8); The final result of Figure[6] Note the special tile (labeled with '■*—') along the left border 
of the completed square gadget initiated the growth of (the first column of) an appropriately rotated square gadget. 
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(b) The '«— ' tile initiates the growth of an- 
other square gadget "west" of the current 
square gadget; the other tile does not ini- 
tiate any subsequent square gadgets 



(c) Tiles that initiate growth of another square gadget "east" of the current square gadget 
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(d) Tiles that initiate growth of another square gadget 
directly "north" of the current square gadget 
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(e) Tiles for the second bit-flip gadget. Here x, y £ {0, 1}, and 

/ : {0, l} 2 > {T, <-, -+, stop} is defined as follows: /(0, 0) = T, 

/(0, 1) = /(l, 0) = -v, and /(l, 1) = stop. 
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(f) Tiles for the first bit-flip gadget. They are essentially the same as the those of the bit-flip gadget in [8] 
Figure 8: Tile types for the square gadget. The thick notches represent strength 9 bonds. 
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